NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

General Syntheses of High-Performance Thermoelectric Nanostructured Solids without Post-Synthetic Ligand Stripping

https://doi.org/10.1021/acs.nanolett.3c01438

Lou, Yue; Li, Xiaokun; Shi, Zhan; Zhou, Hao; Feng, Tianli; Xu, Biao (June 2023, Nano Letters)

Full Text Available
Leveraging Domain Information for the Efficient Automated Design of Deep Learning Accelerators

https://doi.org/10.1109/HPCA56546.2023.10071095

Sakhuja, Chirag; Shi, Zhan; Lin, Calvin (February 2023, International Symposium on High Performance Computer Architecture)

Deep learning accelerators are important tools for feeding the growing demand for deep learning applications. The automated design of such accelerators--which is important for reducing development costs--can be viewed as a search over a vast and complex design space that consists of all possible accelerators and all the possible software that could run on them. Unfortunately, this search is complicated by the existence of many ordinal and categorical values, which are critical to explore for the ultimate design but are not handled well by existing search techniques. This paper presents a technique for efficiently searching this space by injecting domain information--in this case information about hardware/software (HW/SW) co-design--into the automated search process. Specifically, this paper introduces a novel Bayesian optimization framework called daBO (domain-aware BO) that accepts domain information as input, including those describing ordinal and categorical values. This paper also introduces Spotlight, a design tool based on daBO, and this paper empirically shows that Spotlight produces accelerator designs and software schedules that are orders of magnitude better than those created by the state-of-the-art. For example, for the ResNet-50 deep learning model, Spotlight produces a HW/SW configuration that reduces delay by 135x over the configuration produced by ConfuciuX, a state-of-the-art HW/SW co-design tool, and Spotlight reduces energy-delay product (EDP) by 44x over an Eyeriss-like accelerator, which is an edge-scale hand-designed accelerator. In the realm of cloud-scale accelerators, Spotlight reduces the EDP of a scaled-up Eyeriss-like accelerator by 23x. Our evaluation shows that Spotlight benefits from the efficiency of daBO, which allows Spotlight to identify accelerator designs and software schedules that prior work cannot identify.
more » « less
Full Text Available
Distributionally Robust Structure Learning for Discrete Pairwise Markov Networks

Li, Yeshu; Shi, Zhan; Zhang, Xinhua; Ziebart, Brian (March 2022, International Conference on Artificial Intelligence and Statistics)

We consider the problem of learning the underlying structure of a general discrete pairwise Markov network. Existing approaches that rely on empirical risk minimization may perform poorly in settings with noisy or scarce data. To overcome these limitations, we propose a computationally efficient and robust learning method for this problem with near-optimal sample complexities. Our approach builds upon distributionally robust optimization (DRO) and maximum conditional log-likelihood. The proposed DRO estimator minimizes the worst-case risk over an ambiguity set of adversarial distributions within bounded transport cost or f-divergence of the empirical data distribution. We show that the primal minimax learning problem can be efficiently solved by leveraging sufficient statistics and greedy maximization in the ostensibly intractable dual formulation. Based on DRO’s approximation to Lipschitz and variance regularization, we derive near-optimal sample complexities matching existing results. Extensive empirical evidence with different corruption models corroborates the effectiveness of the proposed methods.
more » « less
Full Text Available
Graph-based active learning for semi-supervised classification of SAR data

https://doi.org/10.1117/12.2618847

Miller, Kevin; Mauro, Jack; Setiadi, Jason; Baca, Xoaquin; Shi, Zhan; Calder, Jeff; Bertozzi, Andrea (May 2022, SPIE Defense and Commercial Sensing: Algorithms for Synthetic Aperture Radar Imagery XXIX)
Zelnio, Edmund; Garber, Frederick D. (Ed.)
Full Text Available
A Hierarchical Neural Model of Data Prefetching

https://doi.org/10.1145/3445814.3446752

Shi, Zhan; Jain, Akanksha; Swersky, Kevin; Hashemi, Milad; Ranganathan, Parthasarathy; Lin, Calvin (April 2021, nternational Conference on Architectural Support for Programming Languages and Operating Systems)

This paper presents Voyager, a novel neural network for data prefetching. Unlike previous neural models for prefetching, which are limited to learning delta correlations, our model can also learn address correlations, which are important for prefetching irregular sequences of memory accesses. The key to our solution is its hierarchical structure that separates addresses into pages and offsets and that introduces a mechanism for learning important relations among pages and offsets. Voyager provides significant prediction benefits over current data prefetchers. For a set of irregular programs from the SPEC 2006 and GAP benchmark suites, Voyager sees an average IPC improvement of 41.6% over a system with no prefetcher, compared with 21.7% and 28.2%, respectively, for idealized Domino and ISB prefetchers. We also find that for two commercial workloads for which current data prefetchers see very little benefit, Voyager dramatically improves both accuracy and coverage. At present, slow training and prediction preclude neural models from being practically used in hardware, but Voyager’s overheads are significantly lower—in every dimension—than those of previous neural models. For example, computation cost is reduced by 15- 20×, and storage overhead is reduced by 110-200×. Thus, Voyager represents a significant step towards a practical neural prefetcher.
more » « less
Full Text Available
Generalised Lipschitz Regularisation Equals Distributional Robustness

Cranko, Zac; Shi, Zhan; Zhang, Xinhua; Nock, Richard; Kornblith, Simon (January 2021, International Conference on Machine Learning (ICML))
null (Ed.)
Full Text Available
Installation of synergistic binding sites onto porous organic polymers for efficient removal of perfluorooctanoic acid

https://doi.org/10.1038/s41467-022-29816-1

Liu, Xiongli; Zhu, Changjia; Yin, Jun; Li, Jixin; Zhang, Zhiyuan; Li, Jinli; Shui, Feng; You, Zifeng; Shi, Zhan; Li, Baiyan; et al (April 2022, Nature Communications)

Abstract Herein, we report a strategy to construct highly efficient perfluorooctanoic acid (PFOA) adsorbents by installing synergistic electrostatic/hydrophobic sites onto porous organic polymers (POPs). The constructed model material of PAF-1-NDMB (NDMB = N,N-dimethyl-butylamine) demonstrates an exceptionally high PFOA uptake capacity over 2000 mg g⁻¹, which is 14.8 times enhancement compared with its parent material of PAF-1. And it is 32.0 and 24.1 times higher than benchmark materials of DFB-CDP (β-cyclodextrin (β-CD)-based polymer network) and activated carbon under the same conditions. Furthermore, PAF-1-NDMB exhibits the highestk₂value of 24,000 g mg⁻¹h⁻¹among all reported PFOA sorbents. And it can remove 99.99% PFOA from 1000 ppb to <70 ppt within 2 min, which is lower than the advisory level of Environmental Protection Agency of United States. This work thus not only provides a generic approach for constructing PFOA adsorbents, but also develops POPs as a platform for PFOA capture.
more » « less
Certified Robustness of Graph Convolution Networks for Graph Classification under Topological Attacks

Jin, Hongwei; Shi, Zhan; Peruri, Ashish; Zhang, Xinhua (January 2020, Advances in neural information processing systems)
null (Ed.)
Graph convolution networks (GCNs) have become effective models for graph classification. Similar to many deep networks, GCNs are vulnerable to adversarial attacks on graph topology and node attributes. Recently, a number of effective attack and defense algorithms have been designed, but no certificate of robustness has been developed for GCN-based graph classification under topological perturbations with both local and global budgets. In this paper, we propose the first certificate for this problem. Our method is based on Lagrange dualization and convex envelope, which result in tight approximation bounds that are efficiently computable by dynamic programming. When used in conjunction with robust training, it allows an increased number of graphs to be certified as robust.
more » « less
Full Text Available
Transparent, Flexible, Penetrating Microelectrode Arrays with Capabilities of Single‐Unit Electrophysiology

https://doi.org/10.1002/adbi.201800276

Seo, Kyung Jin; Artoni, Pietro; Qiang, Yi; Zhong, Yiding; Han, Xun; Shi, Zhan; Yao, Wenhao; Fagiolini, Michela; Fang, Hui (January 2019, Advanced Biosystems)
null (Ed.)
Full Text Available

Search for: All records